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Abstract 

In this work we study the problem of constructing stochastic processes with a prede- 
termined covariance decay by parameterizing its marginals and a given family of copulas. 
We present several examples to illustrate the theory, including the important Gaussian and 
Euclidean families of copulas. We associate the theory to common applied time series mod- 
els and present a general methodology to estimate a given parameter of interest identifiable 
through the process' covariance decay. To exemplify the proposed methodology, we present 
simple Monte Carlo applications to parameter estimation in time series. The methodology 
is also applied to the S&P500 US stock market index. 

Keywords: Copulas, decay of covariance, dependence structure, parameter estimation. 



1 Introduction 

Let {Xt}'^Q be a weakly stationary process. Consider the widely studied problem of modeling 
the dependence structure among the variables Xt. From a strictly probabilistic/measure theo- 
retical point of view, a stochastic process can be viewed as a measure on the space of sample 
paths. Within this probabilistic point of view, associated to any process, there is a sequence of 
distributions {F^j^Q, and to every pair of random variables {Xr,Xs) from the process, there 
is a copula Cr,s (Nelsen, 2006) associated to it. In a weakly stationary process, the covariance 
between Xf and Xt+h depends only on the lag h, and, according to the copula version of Hoeffd- 



ing's lemma (see Lemma 2.1), this covariance can be calculated by the knowledge of the copula 
and the marginals of Xt and Xt^^ alone. Hence, studying the distributional properties among 
the variables is of vital importance in understanding the process' dependence structure. 

Many classes of models are known to present certain covariance decay to zero as the lag h 
increases. In the class of ARFIMA(p, d, g) processes (Lopes, 2008; Taqqu, 2003), for instance, 
the covariance may decay exponentially fast to zero or very slowly, depending on the parameter 
d. For heteroskedastic models such as ARCH (Engle, 1982) and GARCH (Bollerslev, 1986), the 
process itself is uncorrelated, but the absolute value and/or the square of the process are not 
and often present slow decay of covariance. Estimation of the rate of such decay is generally of 
importance as it contains essential information about the long run structure of the process. 

In this paper we are interested in studying the following problem. For a process {^n}^o 
with absolutely continuous marginal distributions {-Fnj^Q and for a given parametric family 
of copulas {Cejeee satisfying some minor regularity conditions, we investigate how to obtain 
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a given decay of covariance by choosing a parameterization {6n}'^=i and by assuming that the 
copulas related to the pair {Xn, Xn+h) depends only on the lag h and are given by Cg^. We 
are especially interested in the case of slow decay of covariance, typical of long-range dependent 
processes, but the theory presented here is much more general than that and covers any arbitrary 
decay of covariance as well as for both, stationary and non-stationary processes. 

Applying copulas as a tool to construct time series or other complex models have become 
routine. See, for instance, Lee and Long (2009). An interesting theory of Markov process based 
on a "product"' operation for copulas is developed in Darsow et al. (1992) and investigated 
in Lageras (2010). Some methods for constructing short memory time series based on condi- 
tional copulas are discussed in chapter 8 of Joe (1997). The author also discusses methods 
for constructing Markovian processes and short memory time series based on parameterization 
of distributions belonging to a convolution-closed infinitely divisible class. The method pre- 
sented here is completely different from those and does not assume any special properties on the 
marginal distributions other than absolute continuity. 

To illustrate the theory, we present examples including several families of copulas widely used 
in practice. Special attention is given to Gaussian processes. We also present two applications 
of the methodology. The first application is related to estimating a given parameter of interest 
identifiable through the covariance decay in weakly stationary processes. The second one is 
related to simulation of time series with a prescribed covariance decay by parameterizing a 
given family of bidimensional copulas. The methodology is also applied to the S&;P500 stock 
market index data set. 

The paper is organized as follows. In the next section we recall a few concepts and results 
useful in the paper as well as we introduce the idea of the paper by a simple example. In Sections 
3 and 4, we consider the special cases of the Archimedean family of copulas and the Extreme 
Value copulas, respectively. In Section 5 we consider a general theory including arbitrary decay 
of covariance and present a discussion of the important case of Gaussian processes. In Section 
6 we develop two applications based on the results from the previous sections. In the first one 
we develop a general methodology based on the theoretical results of the previous section to 
estimate a given parameter of interest in time series model, identifiable through the covariance 
decay. A Monte Carlo study is presented to exemplify the methodology and assess its finite 
sample performance. In the second one, we apply the theory to the problem of random variate 
generation of time series with a predetermined covariance decay. In Section 7 the methodology is 
applied to the S&;P500 stock market index data set. Conclusions and final remarks are presented 
in Section 8. 

2 PreliminEiries 

In this section we recall a few concepts and results we shall need in what follows. An n- 
dimensional copula is a distribution function defined in the n-dimensional hypcrcubc J", where 
I := [0, 1], and whose marginals are uniformly distributed. The interest in copulas has grown 
enormously in the last 15 years or so, especially because of its flexibility in applications. Copulas 
have been successfully applied and widely spread in several areas. In finances, copulas have been 
applied in major topics such as asset pricing, risk management and credit risk analysis among 
many others (see the book by Cherubini et al., 2004 for details). In hydrology the modeling 
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of rainfalls and storms often employ copulas to describe joint features between variables in the 
models (see, for instance, the recent work of Palynchuk and Guo, 2011 and references therein). In 

econometrics, copulas have been widely employed in constructing multidimensional extensions of 
complex models (sec Lcc and Long, 2009 and references therein). In statistics, copulas have been 
applied in all sort of problems, such as development of dependence measures, modeling, testing, 
just to cite a few (see Mari and Kotz, 2001, Nelsen, 2006 and references therein). Curiously, the 
main result in the theory, the celebrated Sklar's theorem recalled below, dates back to the late 
fifties. 

Theorem 2.1 (Sklar's Theorem). Let Xi, ■ ■ ■ ,Xn be random variables with joint distribution 
function H and marginals -Fi, • • • ,Fn; respectively. Then, there exists a copula C such that. 



If the Fi 's are continuous, then C is unique. Otherwise, C is uniquely determined on Ran(Fi) x 
• • • X Ran(F„). The converse also holds. Furthermore, 



The usefulness of Sklar's Theorem (and of copulas for extension) as a tool for statistical 
analysis and modeling lies on allowing one to deal separately with the joint dependence structure, 
characterized by the copula, and the marginals of a given random vector. This flexibility has 
been extensively explored in the literature. Another important result which shall be frequently 
used here is the copula version of the Hoeffding's lemma. 

Lemma 2.1 (Hoeffding's Lemma). Let X and Y he two continuous random variables with 
marginal distributions F and G, respectively, and copula C. Then, the covariance between X 
and Y is given by 



More details on the theory of copulas can be found in the monographs by Nelsen (2006) and 
Joe (1997). 

In this work, IN denotes the set of natural numbers F := {0, 1, 2, • • • }, while IN* := IN \ {0}. 
For a given set A C M,, A denotes the closure of A and A' denotes the set of all accumulation 
points. For a vector x G M,^, denotes the transpose of x. The measure space behind the 
notion of measurable sets and functions is always assumed (without further mention) to be 
(H", ^(R'^), m) (or some appropriate restriction of it), where ^(K,") denotes the Borel cr-field 
in and m is the Lebesgue measure in R". Recall that a function L : 5 — )■ R, for 5 C R, is 
called slowly varying at a E S' ii L is measurable, limited on a bounded interval and satisfies 
\imx-^a L{cx) / L{x) = 1, for all c > 0. The set of slowly varying functions on the infinity will be 
denoted by 

^ := {L : L is measurable, limited on a bounded interval and lim L{cx)/L{x) = 1, Vc > 0}. 



H{xi,--- ,x„) = C{Fi{xi),--- ,Fn{xn))-, for all (xi, • • • , a:„) e R". 



C{ui,--- ,Un) = H{Ft'\ui),--- ,Fi-^\un)), for all («i,--- ,u„)e7", 

where for a function F, F^~^^ denotes its pseudo-inverse given by F(-^)(x) := inf {u G Ran(F) : 
F{u) > x} and, for a function f, Ran(/) denotes the range of f. 




(2.1) 



We notice that if Li,L2 G Li + L2, L1L2 G ^ and, for a constant A; G R \ {0}, kLi G =5f. 
Moreover, for any (3 G (0, 1) and L G L{n)n~^ 0, fact usually explored in defining the 
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concept of long-range dependence. More details on slowly varying function can be found in 
Bingham et al. (1987). For two functions / and g, we denote f{n) ~ g{n) if f{n)/g{n) — ?• 1, as 
n goes to infinity. For an arbitrary sequence of real numbers {a^lnew*; consider the function 
V'on : (OjOo) — ^ IK- given by il^a^ix) ■= a^^-\, where [•] denotes the ceiling function. For L £ 
as a convention, by writing anL G .=Sf we mean ipa„L G »Sf. We shall also write a„ G .if to 
mean tpa„ G If a„ — )• a 7^ 0, then always an G if. If an — )• 0, then an may or may not 
belong to depending on the particular convergence rate of the sequence to 0. For instance, 
an = l/ln(n) G -Sf, but o„ = 1/n ^ if. 

As for long-range dependence, even though the literature on the matter is vast, there still 
no globally accepted definition for it. The most common definitions and their differences and 
similarities are discussed in section 4 of Taqqu (2003). In this work, we shall adopt the following 
general definition. 

Definition 2.1. We say that a weakly stationary process with finite variance {Xt}teTS presents 
long-range dependence if 

Cov{Xt,Xt+h) ~ L(h)h~^ , as h —?' 00, 
for some (3 G (0, 1) and L G if . 

We start with an example to motivate the ideas we shall develop in the rest of the paper. 

Example 2.1. Consider the Farlie-Gumbel-Morgenstern (FGM, for short) family of copulas 
which consists of parametric copulas of the form 

Ce{u,v) ^uv{l + e{l-u){l-v)), 

for \9\ < 1. As a particular case, we have the independence copula Il{u,v) := uv, for 9 = 0. 
Recall that a distribution function F belongs to the so-called Type I Extreme Value family with 
parameters (a, 6) G IR x (0, 00), denoted by EVI(a, b), if 

F{x) = e^"'^"'"^^" , forallxeR. 

Notice that F' {F~^{u)) = buln{u), for all u £ F Let Fn ~ EVI(a„, b), for {an}nem an arbitrary 
sequence of real numbers and 6 > 0. Consider the FGM family of copulas with parameterization 
0n '■= KQ^n~'^, n G M*, for a > 1, where kq > C{a) and ({a) = Yl'k^i^''^ so-called 
Riemman zeta function. For a fixed n > 2, consider the collection of (2) copulas Cij := Cg^-_.^ 
for i,j G {1, • • • ,n}, i ^ j. Now consider the n-dimensional copula (cf. example 3.2 in Dolati 
and Ubeda-Flores, 2005) 

l<i<j<n k = l 1 = 1 

whose marginals are Cij. Let {Xn}ne¥i be a sequence of random variables such that Xn has 



distribution Fn, for each n G M. For any n > 2, let C„ given in (2.2) be the copula associated to 
{Xq, • • • , Xn-i), which implies that the copula related to (X,., X^) is Crs, r, s G {0, • • • , n — 1} 
and r ^ s. Furthermore, Hoeffding's lemma implies 



Cov{Xt,Xt+h) = 9h (^j^ 



1-^ ^ \^^ln(2)^^^^^ln(2)^^_ 
&ln(ii) J 6^ h'^HQ 



where the second equality follows from formula 4.267.8 in Gradshteyn and Ryzhik (2000). Since 
the construction ( 2.2 ) is valid for any n > 2, Sklar's theorem guarantees the existence of all finite 
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dimensional distribution functions with the marginals, bivariate copulas and n-dimensional cop- 
ulas as specified in the construction above. Therefore, by the Kolmogorov's existence theorem, 
we have just constructed a weakly stationary process {Xn}neTN by reparameterizing a certain 
family of parametric copulas and its marginals. 

The objective of this paper is to derive conditions for which a certain decay of covariance 
can be achieved by simply parameterizing a certain family of copula and its marginal. We shall 



be interested in a more general framework than the one presented in Example 2.1 in the sense 
that the theory covers both, weakly and strongly stationary processes as well as non-stationary 
ones. 

Definition 2.2. Let {Cgjege be a parametric family of bidimensional copula, for C ]R a set 
with non-empty interior. If there exists a sequence {6n}neTN* ^ © and a sequence of absolutely 
continuous distributions {-FnjneiN such that 

— — T ^ dudv ^ L(n)n ' , 2.3) 

for L G ^ and (3 G (0, 1), we say that (Cq, Fn) is compatible with long-range dependence structure. 



Notice that Definition 2.2 can be translated as follows. For a sequence {Fn}n&¥i of absolutely 
continuous distribution functions and for a sequence {Cn}neTN* of copulas, consider {Xn}neK a 
sequence of random variables such that Xn is distributed as Fn, for each n G M, and such that 
the copula associated to (Xo,X„) is C„, for n G M*. This construction is always possible by 
Sklar's theorem (notice that we are not making any joint distributional assumption, other than 



the ones implied to (Xo,X„), n G M*). In this setting. Definition 2.2 is equivalent to ask that 
Cov(Xo, Xn) ~ L{n)n~^ , for L G »Sf and /3 G (0, 1). Also notice that a strongly stationary long- 
range dependent process always satisfies (|2.3|) , but the construction alone is clearly not sufficient 



to specify a stochastic process, in which case we have to proceed analogously to Example 2.1 

To simplify the notation, for a sequence of absolutely continuous distribution functions 
{Fn}neTN and a parametric copula Cg, we shall write 

Lix) := FMr'H^)) and d'^Ceiu,v) := ^^^J^ . 



3 Archimedean Family 



Recall that the Archimedean family of copulas consists of copulas of the form 

C{u,v) = ip~'^{ip{u) + f{v)), 

for some continuous decreasing convex possibly parametric function ip : / — )• [0,oo], I := [0, 1], 
such that 9^(1) = 0, called the Archimedean generator of the copula. We are interested in 
determining conditions in which a sequence of Archimedean copulas together with a sequence of 
absolutely continuous distribution functions is compatible with long-range dependence structure. 

Proposition 3.1. Let {(pe}ee&j /o'" © ^ IR- with non-empty interior, be a family of Archimede- 
an generators and {Cgjege be the correspondent Archimedean family. Suppose that there exists 
a G 0' such that limg^a feii) = — ln(t), where the limit is to be understood as the adequate 
lateral limit if a ^ int(B). Also assume that there exists a set DC© with non-empty interior 
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such that a £ D' and ipg, seen as a function of 0, is of class in D. Let {-FnjnsM be a sequence 
of absolutely continuous distribution functions and define the sequences 

Ki{n) ■— [ [ lim ^ ^ ' \ dudv and K2[n) ■— [ [ lim , , I } dudv. 

JJl'^ e^D ''Omi'n(V) J Jp lo(u)ln{v) 

Let {^n}neM* be a sequence in D converging to a. Suppose that, for some /3 G (0, 1), Ki{n){6n — 
a) ~ Li(n)n~^ and K2(n){9n — a)^ = o{L2{n)n~^), as n goes to infinity, for Li,L2 G Then, 
{CojFn) is compatible with long-range dependence structure. 



Proof. We present the proof for the case where a ^ int(0) and assuming that a > x for ah 
X £ D. The other cases are proved analogously. Let {am}mem* be a sequence of parameters in 
D converging from the left to a. Applying a Taylor's expansion with Lagrange's remainder in 
Ce around 6 = a, we obtain 



Cg{u,v) = lim Ca„,{u,v) + lim dCa^{u,v){9 — a) + - lim 9^Cq,„ (u, w)(0o — a)^ 
— uv + lim dCa,„{u,v){9 — a) + lim d'^Ca^{u,v){6o — a)'^ , 



(3.1) 



where 6*0 G [On, a). Substituting 6 by On in (3.1), we obtain 



1 2 

Cg^^{u,v) — uv + lim dCa„^{u,v){9n — a) + -- lim d'^Ca^^{u,v)(6o{n) — a) , (3-2) 

where ^o(^) £ [^njo) depends on n and satisfies lim„_s.oo ^ol?^) = o,- Under the notation of the 
enunciate, we obtain 



/2 



Ce,, (u, w) — uv 
lo{u)ln{v) 



dudv 



lim ctMCtv 

/2 m^oo lQ{u)ln[V) 



a 



lim / ^ / ' \ dwdf I (OoM — a) 
j2 m^oo /o(w)«„(i') J 



= ifi(7i)(0„ - a) + ^i^2(n)(0o(«) -a)\ 



(3.3) 



Since 0o(^) £ [^ma), 



|if2(n)|(eo(n)-a)' < |X2(n)(0„-a)'| =o(i2(n)n-'3). 



(3.4) 



by the hypothesis on K2. The result now follows from (3.3) in view of (3.4) and the hypothesis 
on Ki. m 



Example 3.1. The Ali-Mikhail-Haq (AMH, for short) family of copulas is Archimedean with 
generator (^e(t) = (1 - 9)'^ In ([1 + e{t - l)]t"^), for \0\ < 1. The copulas of the AMH family 
have the form 

^ . N UV 

1 — ^(l — u)(l — V) 

As a particular case, we have Co = H. Let {i^njneM be a sequence of distribution functions such 
that Fn ~ Exp(A„), for {Xn}n£K a sequence of positive real numbers. Simple calculations show 
that Ini^) = (1 — x)/Xn and 

Ki(n) — XqXti ( j udu\ — ^ " and K2{n) — 2AoA„ ( j u{l — u)du\ ~ ~Yg~~' (3-5) 
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From (3.5), there are many ways to take A„ and On in order to obtain the desired decay. For 

-/3 



instance, for /3 G (0, 1) and L € by taking A„ G ^ and 



~ n 



or A, 



~ n 



^ and 



On ^ but such that On G -2', then Proposition 3.1 apphes and we conclude that {Cg,Fn) is 
compatible with long-range dependence structure. If A„ = Aq > 0, for all n, as it is the case for 
a strongly stationary process, then by taking On ~ L{n)n~^ , the same conclusion is obtained 



through Proposition 3.1 



Example 3.2. The Gumbel-Barnett family of copulas is Archimedean with generator (pe{x) = 
ln(l — 01n(x)), for G (0, 1] and copulas given by 

Cg{u,v) — exp(— 6* ln(w) ln(u)). 

As a limiting case we have limg^o+ Ce = H. Let F„ be distributed as EVI(a„,6,i), n G IN, for 
{onjngM an arbitrary sequence of real numbers and {6n}neM ^ sequence of positive real numbers. 
A simple calculation shows that 

Therefore, the situation is similar to Example 3.1 For instance, for f3 G (0, 1) and L G 
taking 1/bn G ^ and On ~ n~^, or bn ~ and 0„ — >• but such that On G then Proposition 
|3.1| applies and we conclude that {Cg,Fn) is compatible with long-range dependence structure. 
If 1/bn = ko > for all n, as it is the case for a strongly stationary process, then taking 



On ~ L{n)n ^ , the same conclusion is obtained through Proposition 3.1 



Example 3.3. Consider the Frank family of copulas with Archimedean generator (fe^x) 



ln((e 



-et 



l)(e^ — 1) for G IR \ {0}, with corresponding copula given by 



Ceiu, v) 



In 1 



-du 



l)(e 



-0v 



This is one of the most applied Archimedean copula in statistics. Details of its nice properties 
can be found in Nelsen (2006) and references therein. As a particular case we have Co = H. Let 
Fn be distributed as EVI(a„, n G M, for {anjngiN an arbitrary sequence of real numbers and 
{bn}n£TN & Sequence of positive real numbers. Upon applying formula 4.267.8 in Gradshteyn and 
Ryzhik (2000), it is routine to show that 



Ki{n) 



where 



1 



1 - u 
ln{u 



du 



ln(2)2 
2bobn 



and K2 (n) 



p{u,v) 



-u'^v^ 



2u-'v 



3„3 



UV 



P{u,v) 
bou \Ti{u)bnV \n{v) 



3u^v 



2, ,2 



uv 

IT 



dudv 



fcp 
bobn 



(3.6) 



and ko = 2 ln(3)V3- 2 ln(2) ln(3) + 3 ln(2)V2. In view of for (3 G (0,1) and L G ^, taking 

l/6„ G ^ and On ^ n or 6„ ~ and On — ^ but such that On G then by Proposition 3.1 
we conclude that {Cg,Fn) is compatible with long-range dependence structure. If 1/bn = ko > 
for all n, as it is the case for a strongly stationary process, then taking On ~ L{n)n~^ , the same 
conclusion holds. 
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4 Extreme Value Copulas 

Recall that C is an Extreme Value Copula (EVC, for short) if there exists a copula Cq such that 

C{u,v) = lim CS{u^,vi), 

for all {u,v) G I^. A method to "parameterize" the EVC family is devised in Pickands (1981) 
(see also Nelsen, 2006, p. 97). The construction show that a copula C belongs to the EVC family 
if it can be written as 

Ca{u,v) = exp( ln(ut;)A( ^^^^^ 



hi{uv) ^ 

for some possibly parametric convex function A : I ^ [0.5,1], called dependence function, 
satisfying ^(0) = ^(1) = 1 and A{t) £ [max{t, 1 — t}, 1], for all t £ I. As a particular case, when 
A = 1, Ca = n. For a parametric dependence function Aq, we shall denote the corresponding 
EVC family by Ce := Caq- Under some conditions, the EVC family is also compatible with 
long-range dependence structure. 

To simplify the notation, let 



Ae{u,v) := Ag\ — — - , and dAe{u,v 



\\a{uv) / ' ' do 



for k = 1,2. Proposition 4.1 below presents conditions similar to Proposition 3.1 to obtain the 



compatibility of a EVC family with long-range dependence structure. 

Proposition 4.1. Let {Agjege) for OCR with non-empty interior, be a family of dependence 
functions and let {Cej^ge be the associated EVC family. Suppose that there exists a £ Q' such 
that lim£i_^a Ag = 1 where the limit is to be understood as the adequate lateral limit if a ^ lnt(0). 
Also assume that there exists a set D C with non-empty interior such that a £ D' and ipg, 
seen as a function of 9, is of class in D. Let {-FnjneiN a sequence of absolutely continuous 
distribution functions and define the sequences 

Klin) ■— [ [ lim , , \ dudv and K2{n) ■— [ [ lim , , I } dudv. 

J J 12 e-^a lo{u)ln{v) J J 12 e^-^a lo{u)ln{v) 

Let {^rajneM* be a sequence in D converging to a. Suppose that, for some /3 £ (0, 1), Ki{n){6n — 
a) ~ Li(n)n~^ and K2(n){9n — a)^ = o{L2{n)n~^), as n goes to infinity, for Li,L2 £ ^ . Then, 
{CojFn) is compatible with long-range dependence structure. 



Proof. The proof goes along the same lines as the proof of Proposition 3.1 



It is easy to see that equivalent expressions to Ki and K2 in Proposition 4.1 are as follows: 

uv In(uu) 



Klin) 



12 



lim dAe{u, v) 



lQ{u)ln{v) 



dudv 



and 



K2{n) 



12 



In(uw) 



1 2 



lim dAg{u, v) 



lim d Ae{u, v) 



UV In(uw) 

la{u)ln{v) 



dudv. 



(4.1) 



(4.2) 



The above expressions seem cumbersome at first glance, they are often simpler to calculate 
because the limits inside the integrals usually result in simple expressions. 
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Example 4.1. Consider the following dependence function and the associated EVC family 
Ag{t) = 1 — 9t{l — t) and Cg{u,v) = uvcxp 



6'ln(w)(ln(w) -ln(ww)) 
ln(wi;) 



for 6 £ I. This is known as the generator of the Tawn mixed model (cf. Mari and Kotz, 2001, 
p.96). Notice that Ao{t) = 1, dAo{t) = -t{l - t) and d'^Ao{t) = 0. Let F„ be distributed as 
EVI(a„, bn), n E IN, for {a„}„g]N an arbitrary sequence of real numbers and {hn}n&M a sequence 
of positive real numbers. By using (4.1) and (4.2), 

Ki{n) = 



\n{v)(\n{v) — ln{uv)) 1 
- dudv = — 



/2 



bo bn \n{uvy 



bo bn 



ln(u) ln(t;) 
(ln(M) +ln(i;))' 



dudv ~ — 



6bo bn' 



and 



K2{n) 



(ln(z;)( ln(u) — In(uw)))^ 
bo bn ln(w)3 



dudv 



\n{u) ln{v) 



bob„ J Jp (ln(u) +ln(w))' 



dudv 



1 



15 6o K 



where the integrals in both expressions are calculated by changing variables to u = e *, by 
applying integral by parts and by using formula 3.353.1 in Gradshteyn and Ryzhik (2000). 
Therefore, for 13 £ (0, 1) and L £ ^, taking l/6„ G ^ and On n ^ J OT bn and 9n — ^ 

but such that 9n £ ■££ ^ then Proposition |4. 1| applies and we conclude that (Cg, F„) is compatible 
with long-range dependence structure. If 1/6^ = /cq > for all n, by taking Qn ~ L{n)n~^ the 
same conclusion holds. 



5 General Theory 

In the previous sections we studied two particular cases of a more general theory to be developed 
in this section. The focus of the previous section was on the compatibility with long-range 



dependence structure. However, as it shall become clear in Theorem 5.1 below, with a minimum 
effort we can extend the theory to cover any arbitrary decay of covariance in a wide class of 
parametric families of absolutely continuous copulas satisfying minimal regularity conditions. 

Theorem 5.1. Let {C^jege be a family of parametric copulas, for G C R with non-empty 
interior. Suppose that there exists a point a £ Q' such that limg^a C'e = II, where the limit is 
to be understood as the adequate lateral limit if a ^ int(0). Also assume that there exist a set 
D Q Q with non-empty interior such that a £ D' and Cg, seen as a function of the parameter 6, 
is of class in D. Let {-FnjnelN be a sequence of absolutely continuous distribution functions 
and define the sequences 

Ki(n) ■= [ [ lim , , ' \ dudv and K2(n) ■= f f lim , , ^[ [ } dudv. 

J J P- e^-^a lo{u)ln{v) J J 12 e-^a lo{u)ln{v) 

Let {9n}neK* be a sequence in D converging to a and let {Xn}n£Vi be a sequence of random 
variables such that Xn ~ Fn, n G IN and the copula associated with (Xo,X„) be Cg^. Given 
a measurable function i? : IR — )• R satisfying lim^^oo R{n) = 0, a sufficient condition for 
Cov(Xo, Xn) ~ R{n), as n tends to infinity, is that Ki{n){6n — a) ~ R{n) and K2{n){dn — a)^ = 
o{R{n)). 



Proof. We present the proof for the case where a ^ int(0) and assuming that a > x for all 
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X £ D. The other cases are dealt analogously. Let {am}m£TN* be an arbitrary sequence of 
parameters in D converging from the left to a. Applying a Taylor's expansion with Lagrange's 
remainder in Cq around 6 = a, and proceeding as in the proof of Proposition 3.1, we obtain 

1 2 

Cg^^{u,v) — uv + lini dCa^^^{u,v){6„ — a) + - lim d'^Ca^^{u,v)(6o{n) — a) , (5-1) 

with the notation as in the enunciate. By Hoeffding's lemma it follows that 

JJj2 lo{u)ln{v) \JJj2m^oc la(U)ln(V) ) 

= Ki{n){e^ - a) + ii^2(n)(0oH - of, 

where 9Q{n) £ [On, a) depends on n and satisfies lim„_i.oo Ooin) = a, provided the integrals exist. 
By hypothesis 

\K2{n)\{d^{n) ^ af < \K2{n){9„ ^ af\ ^ o{R{n)), 
and the result follows from the hypothesis on Ki{n). ■ 



Remark 5.1. We observe that calculating Ki and K2 in closed form may be cumbersome. In 
practice, however, numerical integration can be applied with a high degree of precision, since 
Ki and K2 are usually very smooth functions, when they exist. When they do not exist, the 
theory does not apply, but the method of direct parameterization as in Example 2.1 may be 
used instead, although finding a suitable parameterization in this case may be a non-trivial task. 

Example 5.1 {Covariance Decay on Gaussian Processes). Let 0, <I> and 'I'"^ denote the den- 
sity, the distribution function and the quantile function of a standard normal random variable, 
respectively. Also let $p denote the distribution function of a bivariate normal distribution with 
mean (0, 0)"^ and variance-covariance matrix given by O := (pi)- The so-called Gaussian family 
of copulas comprehend the copulas given by 

Cp{u,v)^^p{^-\u),^-\v)), 

for p G [—1, 1]. As a particular case we have Co = H. Let Fn = ^, for all n G M. For simplicity, 
for u G /, we shall denote qu ■= ^^^{u). The chain rule yields 



dCp{u,v) _ pCp{u,v) 



dp 



27r(l-p)i 27rVr 



1 -p2 



dxdy, 



where S := S{u,v) = {— 00, Qu] x (— oo,g„]. Thus, 



lim 

p->-0 



dCp{u,v) 

dp 



1 

2^ 



ye 



~dy 



ye ^ dy] ^ 0(9„)0((?„), 



where the last equality follows from integration by parts. Hence Ki{n) = 1. We move to 
calculate K2- Elementary calculus yields 



d^Cp{u,v) 

lim = uv 

P-s-o dp^ 



uv ■ 



- y^)(t){x)(j){y)dxdy 



uv - quqv(f'iqu)(f>{qv) = quqv(f'iqii)(f>iqv) 
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Now, since F„ ~ M{0, 1), 

2 / „ X 2 



where the second equahty foUows by changing variables io y = Qu- Therefore, if {Xn}neTN is a 
sequence of standard normal random variables and the copula associated with {XQ,Xn) is Cp„, 
the calculations above show that the parameterization we choose for the sequence {pnjneiN* will 
ultimately determine the decay of covariance (and the decay of correlation for that matter, since 
Xn ~ A/'(0, 1)) of {Xo,Xn)- Consider the strongly stationary Gaussian process {X„}„g]N where 
Xn ~ AA(0, 1) and the copula of {Xr,Xs) is Cp^^_^^ (such a construction is always possible for 
Gaussian processes, since all finite dimensional copulas can be taken n-dimensional Gaussian 
copulas with the appropriate covariance structure). Some examples are as follows. 

• For g G M*, and {'&n}n=o, the parameterization 

^ 9-|"l 



1 j=0 



determines a Gaussian MA(q) process (cf. Brockwell and Davis, 1991, p. 78 with the 
adequate adaptations). 

For \ip\ < 1, the parameterization p„ = ip^'^^ determines a Gaussian AR(1) process (as 
noted in Joe, 1997). 

The parameterization pn = 2~"(1 + 0.75n), for all n G M*, determines a Gaussian 
ARMA(2, 1) process defined by (1 - S + 0.25B'^)Xt = (1 + B)Zt, t G IN, for {Zt}teTN 
i.i.d. A/'(0,32/3), where B denotes the backward shift operator (cf. Brockwell and Davis, 
1991, p. 92 with the adequate adaptations). 

For d G (—0.5,0.5), the parameterization 

for n G M*, determines a Gaussian ARFIMA(0, d, 0) process (see Lopes, 2008), which is a 
long-range dependent process. 

For {cfcjfcgfj a sequence of real numbers satisfying XlfceiN '^fc ^ parameterization 
Pn = ^Jg" '^j^j+H (iQiQTi.Tu^[T[^Qg a general Gaussian linear process with coefficients {ck}ki^K- 



Remark 5.2. We notice that in Example 5.1, we assume that Xn ~ AA(0, 1). But in the usual 
definition of an ARFIMA(0, d, 0) process with standard normal innovations, for instance, the 
marginals are distributed as AA(0, ^^gj^j c|) , where the sequence {ck}k£]N are the coefficients 
of the MA(oo) representation of the process. So that, in practice, one has to be careful to 
distinguish the meaning of the parameter p„, which can be misleading. 



The general framework of Theorem 5.1 can be extended to cover the case of parametric 
families of copulas for which the parameter space C Mf', k G W*. Let {Co}e&e)^ © ^ be a 
family of copulas for which Ce is twice continuously differentiable with respect to 6 on an open 
neighborhood U C B of a point a = (ai, • • • , a^)"'" G int(0). Recall that the differential of Ce 
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with respect to at a G M,^ is the hnear functional deCaiu, v) :M.^ ^ M. whose value at a point 



Ji,--- ,bkV gR'' is 



deCaiu, v) -b^^ — Ceiu, v)b. 



The second differential of Cq with respect to at a G applied to 6 = • • • , bk)'^ G M,^ is 
given by 



dlCa{u,v)-b^ = 



d0ide. 



Ce{u, v)bibj 



With this formulation, in the following Theorem 5.2 we consider general decay of covariance in 
parametric families of copulas for which the space of parameter is a subset of ^ , k >2, allowing 
for some of the parameters to remain fixed. 

Theorem 5.2. Let {Cgjege, for Q Q R'^+^ with non-empty interior, k £ M* and s £ K, be a 
family of parametric copulas for which there exists a € Q' such that limg^aCo{u,v) = uv, for 
all u,v £ I. The limit is to be understood as the coordinatewise adequate lateral limits in case 
a ^ int(0), also allowing for s coordinates to remain fixed, that is, we allow for 

9 = (6*1, • • • ,0fe,6'"_|_i, • • • ,9k+s) ^ ■ ■ • j«A;,^'fc+l, ■ ■ • ,(^k+s) = O'- 

Also assume that there exists a set D (1 Q with non-empty interior such that a £ D' and Cg is 
twice continuously differentiable with respect to {^i,--- ,9^} in D. Let {-FnjneM be a sequence 
of absolutely continuous distribution functions and define the sequences 



and 



1 



lim — — ^ dudv, i 



j2 /o(u)/„(w) e^a dOi 

d^Cg{u,v) 

lim 



1 



dudv, i, j — 1, ■ ■ ■ ,k. 



ij2 loiu)lniv) S^a dOidOj 

Let {^njnGiN* be a sequence in D converging to a, with possibly s fixed coordinates and let 
{Xn}neK be a sequence of random variables such that Xn ~ F^, n G M, and the copula associated 
with (^XQ,Xn) is Con- Given a measurable function i? : 11 — Hi satisfying lim^_j.oo R{n) = 0, 
suppose that 

k k 

Y,K't\n){\e^^ - a,\) ^ R{n) and ^ - a.|) - a,|) = o(i?(n)). (5.3) 

i—l — l 

Then, Cov(Xo,X„) ~ R{n) as n goes to infinity. 



Proof. We present the proof for the case where a ^ int(0). The other cases are dealt analogously. 
Let {oLm}m<^w* be an arbitrary sequence of parameters in D such that cxm o, (assuming the 
adequate lateral limit when necessary, allowing for s coordinates to remain fixed). Applying 
Taylor's formula with Lagrange's remainder in (^i,--- ,9^)'^ around (ai,-- - ,afc)"'" (the other 
parameters are fixed), there exists Oq = {9l^\ • • • , 9^^^)'^ such that \9q^ — ai\ < \9i — ai\, for all 
i = 1, - ■ ■ ,k, and 



Cg{u,v) = lim Cq,^(u, w) + lim d0Ca^{u,v){\O — a\) + - lira dgCaJiu,v){6o — a)^ 



uv ■ 



Elim 
m — ^oc 



'dCe{u,v) 






0=a„,. 



{\0^-a,\) + 
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1 

2 ^ m—^oo 



d^Co{u,v) 



de,de.j 



(f) 



(5.4) 



Substituting 0„ as in the enunciate in (5.4), we have 



Ce„ (m, u) = + > lim 

' ^ rr?, — ^nr 



'dCg{u,v) 




[ do. 





1 

- ^ lim 



d''Ce{u,v) 



dOidOj 



(|0«(n)-a.|)(|C(n)-a,|), (5.5) 



where OQ\n) G ( minj^n'', a,}, max{^i*\ Oj}) U {6n^}, for i = I,-- - ,A;. Let {Xn\n&i and 
{F„}„g]N be as in the enunciate. Hoeffding's lemma combined with (5.5) yields 



Cov(Xo,X„) = ^ 



1 



lim 



dCe{u,v) 



j2 lo{u)lniv) m-s-oo d9i 
k 



dudv 



lim 



j2 loiu)ln{v) m-i-oo dOidOj 



(|0«(n)-a,|)(|0(^")(n)-a, 



dudv 



i.3 = i 



R{n) + o{R{n)) - i?(n), 



by the hypothesis on k'^'^ and k'^'^'^ ■ 



Conditions (5.3) are general ones. Observe that if there exists € {!,••• , fc} such that 

Kf°\n){\0^n"^ - a.ol) ^ and iff^(|^^« - a,|) = 0{R{n)), for all z ^ zq, 

and 

=o(i?(n)), for alH,j = l,--- , fc, * < j. 



then the conclusion of Theorem |5.2| holds. These conditions, which imply (5.3), are usually 
simpler to verify. 

Recall that the Frechet-Hoeffding lower bound copula and upper bound copulas are given 
respectively by, W{u, v) = maxju + w — 1, 0} and M{u, v) = minjti, v}. The importance of the 
copulas W and M relies in two facts. First, for any copula C, W{u,v) < C{u,v) < M(u,v), for 
all u,v £ I. Second, the copula related to the random vector (X, h{X)^ is W if, and only if, h is 
decreasing and it is M if, and only if, h is increasing. A family of copulas for which the copulas 
n, W and M are particular (or limiting) cases is said to be comprehensive. 

Comprehensiveness is a highly desirable property for a family of copulas because this means 
that the family can model a broad range of dependence structures. The Frank copula of Example 
|3.3|and the Gaussian copula of Example 1 5 . 1 1 are examples of comprehensive families. The FGM 



family from Example 2.1 is not, but due to the simple analytical form of the copulas in this 
family, it is widely employed in the literature in modeling, in testing association and in studying 
efficiency of nonparametric procedures (cf. Nelsen, 2006, p. 78). However, this family is arguably 
too restrictive for most applications as, for instance, the Kendall's r dependence coefficient for 
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this family ranges in [—2/9,2/9]. Since the space of ah bidimensional copulas is a convex 
space, a simple solution for this problem is to consider a new copula obtained from the convex 
combination of the FGM copula and another copula(s) presenting the desired complementary 
characteristics. 



Example 5.2. Let {C^}^g[_i^i] denote the FGM family of Example 2.1 and let {Cs}s£[i,oo) be 
the Euclidean family of copulas with generator ifs{t) = (1 — t)^. This family comprehend the 
copulas of the form 

Cs{u,v) :=niax{l- [(1 - u)'' + (1 - i;)*]^ O}, for S > 1. 



Particular cases of this family are Ci = W and Coo 
comprehensive family of copulas by setting 



M . Define a new three-parameter 



C6i(m, v) — aCj{u, v) + {1 — a)Cs{u, v), 



(5.6) 



where 6 := (7, a, (5)"^ e [-1,1] x [0,1] x [l,oo). Notice that Cq = U when 6 = (0,1,5)"^, for 
all S G [l,c«), so that, in the notation of Theorem 5.2, a := (0,1,5)"'". In order to exemplify 
the use of Theorem 5.2, we shall analyze the compatibility of the family (5.6) with long-range 
dependence. For simplicity, let us fix (^0 = 1 and consider the triple 6 = (7, a, (5o)"'". Consider 
the family of triangular distribution functions in [a,b], denoted by Tr(a, 6), whose distribution 
function and density are given, respectively, by 



F{x; a, h) 



b — a 



and f{x] a, b) 



2{x - a) 
{h-aY ' 



for all X G [a, b\. For two bounded sequences of real numbers {anlnsM and {^nlnew, with 6„ > a„ 
for all n € IN, let {FnjneiN be a sequence of distribution functions such that Fn is distributed as 



Tr(a„, &„), for each n S M. In this case ln{x) = ■ Let us denote d-^Ceiu^v) : 



dCg{u,v) 
97 



and 

similarly for the derivative with respect to a. The first derivative of Ce with respect to 7 and 
a are 

d^Cg{u,v) — auv{l — u){l — v) and daCe{u,v) ~ C^{u,v) — Csg{u,v), 

so that 



and 



y/u{l - u)du I {bo - ao)(&„ - a„) 
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2 (bo - ao)(6„ - an) 



1 



J y/u duj — 



W{u,v) 



dudv 



{bo ~ ao){bn - a„) = fco(&o - ao)(&n - an), 



where k\ 



= 9 



As for the second derivative, since Ce is a linear function of (7, a). 



d^Ce{u,v) d^Co{u,v) ^ , dlMu,v) ,,,,,, 
— =0, and „ „ — uvln{u)\n{vj, 



so that 



9 



{bo - ao)(6„ - ttn) 



d^da 



and 



K. 



(1.1) 



(n) ^ -^V) = 0. 



Therefore, if we choose bn — an ^ -2', with bn — an ^ 0, 7^ ~ L{n)n~^ and an — 1 ~ o{n~^) (or 
vice-versa), for f3 £ (0, 1) and L £ then {Ce-,Fn) is compatible with long-range dependence. 
If we take a„ = oq and 6„ = 60, ao < ^o, instead, we still obtain the result. 
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Example 5.3. Recall that the Archimax family of copulas (Caperaa et al., 2000; Marl and 
Kotz, 2001) comprehend copulas of the form 



C^,a{u, v) = ip M {ip{u) + f{v))A 



(fiu) 



(p{u) + (p{v) 



where <p is an Archimedean generator and A is a dependence function of an EVC family. Notice 
that when ip = 1 we obtain the Euclidean family with generator ip and when ip(t) = —ln(t), we 
obtain the EVC family with dependence function A. Consider 

Mt) = In ^) and Ao,,p{t) = 1 - mm{at, (3{1 - t)}, 



for 6 G [—1,1) and a,/3 G I. Notice that pg generates the AMH family from Example 3.1 



while Aa,i3 generates the Marshall-Olkin family of copulas (Nelsen 2006; Mari and Kotz, 2001), 
comprehending copulas of the form 

Ca,piu,v) = mm{uv^~°',vu^~^}. 

Also notice that Aq^^ = Aafl = 1. Fixed (3q G (0, 1), the Archimax copula generated by pg and 
An fln can be written as 



(6>-i)to1-° -r ^ {e-\)v^ 

V a — [1 — 6^(1 — DjJ a 

{e-\)vu^-f'^ -f {e-\)v^ 

e)„,,i-/3o-[l-9(l-„)][l-6l(l-u)]i-/3oe'3o ' ^ ^ • 

I \ lii \ li ^ a — [1 — 6/(1 — d)J a 

Hence, Theorem |5.2[ does not apply since there is no set where the copula is twice differentiable 
on (a,0) for all (u, u) S P' . Notice that the support of the copula depends on both, the 
parameters (a,^) and the argument (ti,v). 



6 Applications 

In this section we present two applications of the theory developed in the previous sections. The 
first one is related to the parameter estimation in Gaussian time series models and the other is 
related to the simulation of time series with a given decay of covariance. 

We start by setting the mathematical framework. Let {Cg^e^Q be a family of parametric 
copulas, for C R with non-empty interior. Assume that there exists a point a G 0' such that 
lime_j.a Cg = n, where the limit is to be understood as the adequate lateral limit if a ^ int(0). 
Also assume that there exist a set D Q Q with non-empty interior such that a £ D' and Cg, 
seen as a function of the parameter 0, is of class in D. Let {^njneiN* be a sequence in D such 
that lim^-^oo dn = cl- Let {Xn}ne¥i be a weakly stationary process for which X„ is identically 
distributed with common absolutely continuous distribution F, for all n G W. Notice that the 



choice of copulas and F implies that Ki(n) and K2{n) in Theorem 5.1 are constants, provided 
they exist. Also assume that On — a = R{n,j), where 7) is a given measurable function 
incorporating Ki and satisfying R{n, 7) — )• 0, as n goes to infinity, and 7 G 5 C R is some 
(identifiable) parameter of interest. These assumptions make the framework compatible with 
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the assumptions of Theorem 5.1, from which we conclude that Cov{Xt, Xfj^-h) ~ R{h,^). 

In this context, in principle, estimating the parameter 7 is equivalent to estimating the 
particular parameterization {On}- Suppose we observe a realization (time series) xi, - ■ ■ , x„ from 
{Xn}n<aK as in the previous paragraph. Our goal is to estimate 7 based on these observations. 
To estimate the parameter 7, the theory developed in the last sections suggests the following 
simple methodology. 



1. We start by obtaining an estimate F of the underlying unknown marginal distribution 
function F. This can be achieved by any reasonable method. For instance, we could apply 
a parametric method by fitting a distribution to the data, or a non-parametric one, such 
as splines or any other kernel density estimator. 

2. With the estimated distribution function F at hand, we can obtain Ki and K2, which 
must be finite and Ki 7^ 0. We then form a new time series by setting m := F{xi), for 
i = 1, • • • , n. Notice that yi will lie on the unit interval. 

3. Next we form a bivariate time series {u]l''}^Z^ by setting u^^"^ := yj+i), i = 1, • • • , n — 1 
and by removing pairs containing or 1, if any. By Sklar's theorem, {it[.^^}^Zj can be 
regarded as a correlated sample from Cq. From these pseudo observations, 9 can be 
estimated by using any reasonable method. For instance, a maximum likelihood procedure 
or a nonparametric approach such as the inversion of Kendall's r or Spearman's p method 
could be applied. Let 61 denote the estimate obtained. Notice that — a is an estimate 
of i?(l,7)- 

4. Proceeding analogously for each s G {2, • • • ,m}, where m < n is the maximum desired 
lag of estimation, we form the sequence {'U'^^^}^'!^, by setting u^^"* := {yi,yi^s) (pairs 
containing are removed, if any), from which we obtain the estimate 9s- For each s, Og — a 
is an estimate of R{s,^). 

5. Let ^ : II x M.^ — J- [0,oo), k > 1, be a given function measuring the distance be- 
tween two vectors. For k G {I,-- - let 6^ '■= {61 — a, - - - ,6^ — a)"*" and Rkij) '-= 
(i?(l, 7), • • • , R{k, 7))"''. The estimator 7 of 7 is then defined as 

^ ni 

7:=— ^7fc, where 7^ := argmin{^(0fe, ^^(7)) }, k^l,---,m. 

^ k=i -^^^ 

In Step 1 , a histogram can be useful in determining a parametric distribution function candidate 
to fit the data. Alternatively, splines or some other kernel density estimator can be used to 
estimate F. In the case of a Gaussian process, a good estimate of F is by taking F normally 
distributed with mean and variance equal to the sample mean and sample variance, respectively. 
In Step 2, numerical integration can be applied to obtain Ki and K2 since, when they exist, 
they are usually very smooth functions. 

Remark 6.1. The estimates 9 obtained from the bivariate samples in Steps 3 and 4 above may 
be highly biased depending on the strength of the dependence in the process. For instance, 
the maximum likelihood procedure for copulas is consistent under some regularity conditions, 
which include an i.i.d. condition on the sample. Small departures from it still yield reasonable 
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estimates, but in the presence of strong dependence, the procedure may result in underestima- 
tion. In that case, some bias correction method can be apphed. See Subsections 6.1, 6.2 and 
16.31 below. 



In Step 4, the maximum desired lag m depends highly on the nature of the parameter 7. For 



instance, as we shall see in Subsections 6.1 and 6.2 for Gaussian MA(1) and AR(1) processes. 



m = 1 is a convenient choice for estimating the parameter of the process. As a rule of thumb, 
m should be a small fraction of n, since the bivariate correlated sample {u^™'''}^"^ has size 
n — m so that, in order to yield reasonable estimates, it is desirable that the difference n — m 
is reasonably large. Also removing pairs containing and 1 from is necessary because 

C(0, «) = C(u, 0) = 0, and C{l,u) = C{u, 1) = u for all u G I, for any copula C, so that the 
copula density function at these points is 0. From our simulations results, presented in the next 
subsections, when the memory parameter in a strongly dependent process is of interest, usually 
small m yields better results than large ones due to the cumulative bias in the estimation of 
Ok- For processes with weaker levels of dependence, larger ones yield slightly better results for 
the memory parameter than smaller ones, since the estimation bias problem in 6k is not so 
substantial. 

For u,v G R'"', k > 1, say u = • • • ,Uk)'^ and v = {vi, ■ ■ ■ ^VkY , usual choices for the 
function Si in Step 5 are 

= ^ |uj - Uil and Si{u,v) ='^^^{ui- Vif. 

i=l i=l 



In order to exemplify the methodology and assess its finite sample performance, we carry 
out some Monte Carlo experiments for common used time series models under a wide variety of 
dependence structures, from Markovian to strong long-range dependent ones. In all simulations 
we assume that the innovations are standard normally distributed. The marginal distribution 
F is always estimated by taking F to be normally distributed with mean and variance equal to 
the sample mean and sample variance of the time series, respectively. The underlying bivariate 
copulas are always assumed to belong to the Gaussian family and the parameter is estimated 
via the maximum likelihood procedure. 

The main task of simulating the time series and performing Steps 1 through 4 are executed 
by using the computational resources from the (Brazilian) National Center of Super Computing 
(CESUP-UFRGS). The routines are ah implemented in FORTRAN 95 language with OpenMP 
directives for parallel computing. Step 5 and post processing were performed by using the free 
statistical software R. 

6.1 Estimation on Gaussian MA(1) Processes 

Recall that a weakly stationary process {Xt}te-K is said to be an MA(g) process if, for all f G M, 

where 'd{z) := 1 + "Qiz + • • • + '&qz'i. When g = 1 we obtain an MA(1) process given by 
Xt = Zt + 'dZt-i- We shall assume that I"!?! < 1 for identifiability purposes. In the simulations 
we take cri = 1. 
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In the framework of the methodology explained in the beginning of this section, the parameter 
of interest in this application is 7 = "i?. Notice that in the notation of Example 



and pn = 0, for n > 1, so that we set m = 1. Also observe that assuming -i? G (—1,1), then 

pi e (-0.5,0.5) and ^ = ^^^0^. 

To assess the performance of the methodology, the following Monte Carlo experiment is 
carried out. We simulate Gaussian MA(1) processes of length n = 1,000 and coefficient 
■(? G {-0.9,-0.5,-0.1,0.1,0.5,0.9}. As mentioned before, the estimation is performed for a 
maximum lag m = 1 so that there is no need to specify a function S'. With the estimated cop- 

.la p— . p, ...e n.tu.a, idea is .o estimate , ,y ? := Howeve, ,o. parameter 

values near ±1, it may happen that the estimate pi falls outside the region (—0.5, 0.5) in which 
case -i? ^ R. A simple way to overcome this problem is the following. Given pi, we estimate "i? 
by ^ 

i9 := ^ ^ y^(^i) ^ where i'ix) := sign(a;) min{0.5, Ixl}, 

sVapi) 

where sign(x) = —1, if x < 0, and 1 otherwise. The experiment is replicated 1,000 times. 



Table 6.1 reports the mean estimated value and its mean square error (mse, in parenthesis). 



From Table 6.1 we observe that for parameter values ±0.1 and ±0.5, the procedure performs 
well producing estimates with both small bias and small mse. For = ±0.9 the estimates are 
still reasonable, but a greater bias and mse are observed compared to the other cases. This is 
due to many values of pi falling outside the region (—0.5, 0.5). 

Table 6.1: Estimation results for Gaussian MA(1) and AR(1) processes upon applying the methodology 



of Subsections 6.1 and 6.2 Presented are the mean estimated value along with the corresponding mean 



square error in parenthesis 



Parameter 


Model 


Parameter 


Model 


MA{1) 


AR(1) 


MA(1) 


AR(1) 


-0.9 


-0.8777 (0.0168) 


-0.8989 (0.0002) 


0.1 


0.0990 (0.0009) 


0.0973 (0.0011) 


-0.5 


-0.5063 (0.0031) 


-0.5011 (0.0008) 


0.5 


0.5048 (0.0031) 


0.4970 (0.0007) 


-0.1 


-0.1014 (0.0011) 


-0.1012 (0.0010) 


0.9 


0.8739 (0.0178) 


0.8971 (0.0002) 



6.2 Estimation on Gaussian AR(1) Processes 

Recall that a weakly stationary process {Xt}teK is a Gaussian AR{p) process if, for all t G W, 

ipiB)Xt = Zt, Zt^N{0,al), 
where (p{z) := 1 — Lpiz — ■ ■ ■ — ippZ^. When p = 1, we obtain the AR(1) process given by 

In this case the parameter of interest is 7 = (/?. The estimation is particularly simple because 
/9i = 99 so that we can set the maximum lag as m = 1. We perform the following Monte 
Carlo experiment. We simulate a Gaussian AR(1) process of length n = 1,000 and coefficient 
if G {-0.9,-0.5,-0.1,0.1,0.5,0.9}. As mentioned before, the estimation is performed for a 
maximum lag m = 1 so that there is no need to specify a function The value obtained, pi, 
is taken as the estimate of 99. We replicate the experiment 1,000 times. 

The mean estimated value and its mean square error (in parenthesis) are presented in Table 
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|6.1[ The results show that the methodology works very well, producing good estimates with 
fairly small mean square error. Notice that the higher the absolute value of the parameter, the 
more precise the estimation. 



6.3 Estimation on Gaussian ARFIMA(0, 0) Processes 

Recall that a Gaussian ARFIMA(j), d, q) process {Xt}teTN is a weakly stationary solution of the 
difference equations 

ifiB){l-BfXt = ^B)Zt, Zt^N{0,<jl), (6.1) 
where ip{z) := 1 — (piz — • • • — (/jpZ^ and 'd(z) := 1 + ^iz + • • • + ^qz'^ are polynomials assumed 



to have no common roots. It can be shown that for d G ( — 1, 0.5), the solution of (6.1 ) is causal 
and invertible, provided that the polynomials (p and do not have roots for \z\ < 1. In this case, 
{Xt}teTN will have an MA(cxd) representation Xt = J2keJN '^k^t-k^ ^oi all i G M, where {ck}keTN 
satisfies X^fcew '^1 < °° (^^^ Palma, 2007, p. 44 for more details). When p = q = 0, we obtain the 
ARFIMA(0, d, 0) process given by (1 - B)'^Xt = Zt. 

In the case of an ARFIMA(0, d, 0) the interest lies on estimating the memory parameter d, 



that is, 7 = d. The particular parameterization of the Gaussian copulas in this case is (5.2), 



as given in Example 5.1 In this case there is no obvious choice for the maximum lag m. Also, 
for large d, one can expect a significantly high bias in the parameter estimation of the copulas, 
which behaves cumulatively in performing Step 5 of the methodology as m increases. Therefore, 
a balance in the choice of m has to be searched for. Intuitively, one can expect to obtain better 
results with higher values of m when the dependence is weaker, and better results for small 
values of m in the presence of strong dependence. 

To assess the performance of the methodology proposed, we perform a Monte Carlo study by 
simulating Gaussian ARFIMA(0, d, 0) processes of length n = 1,000 for d E {-0.1,0.1,0.2,0.3, 
0.4,0.45}. The series were simulated by truncating the MA(oo) representation of the pro- 
cess. The truncation point is fixed at 50,000 for all cases. In the simulation we apply m G 
{5, 10, 25, 50, 100}. For each simulated time series and each m, we estimate the copula parame- 



ters following Steps 3 and 4 to obtain, with the notation of Example 5.1, pi, - ■ ■ ,Pm- For each 
k G {1, • • • , m}, we calculate 



cie(-0.5,0.5) 



^ m 



dk= aiguAn {—S"\ph-T{l- d)T{d)-^h'^'^-^\\. (6.2 



h=l 



The estimate of d is then dean '■= rn ^ YlT=i ^k- We replicate the experiment 1,000 times. Let 
us call the estimator obtained from (|6.2l) the canonical estimator. The mean estimated values 



along with their mean square errors are reported in the left side of Table 6.2 



In Table 6.2 we see that for d < 0.2, the methodology works well, but as d increases and 
the dependence among the variables become stronger, an increasingly significant bias in the 
estimates is observed. As for the value of m, we see that except for d = —0.1, m = 25 seems to 
produce overall the best results. However, for d > 0.3, the estimates are so biased that they are 
of no practical relevance. As we mentioned before, this high bias for high values of d is due to 
the bias on the estimation of the copula parameter based on strongly dependent data since, in 
this case, the maximum likelihood method greatly underestimates the parameter 6. 
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Table 6.2: Estimation results for ARFIMA(0, d, 0) processes upon applying the methodology of Sub- 
section 6.3 for lags m e {5, 10,25,50, 100} and by applying the canonical estimator (6.2 1 and the bias 
corrected estimator (|6.3|). Reported are the mean estimated value and its respective mean square error. 



d 


Canonical Estimator by Lag m 


d 


Bias Corrected Estimator by Lag m 


5 


10 


25 


50 


100 


5 


10 


25 


50 


100 


-0.10 


n 1 noR 

-U. -LUUO 

(0.0000*) 


-u. luuy 
(0.0000) 


-U. lUlU 

(0.0000) 


1011 

(0.0000) 


1011 
(0.0000) 


-0.10 


OQAS 

(0.0000) 


0Q4Q 

(0.0000) 


OQf^O 

(0.0001) 


OQ^l 

(0.0000) 


OQf^l 

(0.0000) 


0.10 


0.0979 
(0.0003) 


0.0982 
(0.0003) 


0.0986 
(0.0001) 


0.0985 

(0.0000) 


0.0983 
(0.0000) 


0.10 


0.1045 
(0.0007) 


0.1046 
(0.0006) 


0.1047 
(0.0002) 


0.1044 
(0.0000) 


0.1040 
(0.0000) 


0.20 


0.1867 
(0.0013) 


0.1874 
(0.0007) 


0.1878 
(0.0000) 


0.1875 
(0.0000) 


0.1866 
(0.0002) 


0.20 


0.2105 
(0.0031) 


0.2093 
(0.0020) 


0.2072 
(0.0003) 


0.2050 
(0.0001) 


0.2024 
(0.0000) 


0.30 


0.2636 
(0.0004) 


0.2652 
(0.0001) 


0.2664 
(0.0006) 


0.2661 
(0.0008) 


0.2640 
(0.0014) 


0.30 


0.3118 
(0.0022) 


0.3080 
(0.0013) 


0.3021 
(0.0000) 


0.2971 
(0.0001) 


0.2913 
(0.0003) 


0.40 


0.3307 
(0.0006) 


0.3325 
(0.0011) 


0.3337 
(0.0044) 


0.3333 
(0.0048) 


0.3314 
(0.0056) 


0.40 


0.4058 
(0.0002) 


0.3966 
(0.0002) 


0.3856 
(0.0015) 


0.3774 
(0.0017) 


0.3688 
(0.0023) 


0.45 


0.3599 
(0.0024) 


0.3617 
(0.0034) 


0.3628 
(0.0084) 


0.3624 
(0.0089) 


0.3603 
(0.0092) 


0.45 


0.4468 
(0.0001) 


0.4355 
(0.0006) 


0.4220 
(0.0036) 


0.4124 
(0.0040) 


0.4028 
(0.0042) 



Note *: 0.0000 means that the mean square error is smaller than 5 X 10 ^. 



To overcome this deficiency, we propose the following bias correction procedure. We follow 



the same steps as before but we substitute (6.2) by the following optimization procedure 



^ m 



dk := argmin <j - V | P/, - T{d)-^h^'^-^ }, (6.3) 
de(-0.5,0.5) ["T- ^ J 

and we set dcor '■= X^fcLi dk just like before. Let us call this the bias corrected estimator. 



Notice that the difference between (6.2) and (6.3) is the absence of r(l — d) in the latter. The 



motivation for such correction lies on the behavior of r(l — d) as shown in Figure |6.1[ a). For 
small values of d, r(l — d) assumes values close to 1, so that the good canonical estimates should 
not change much, but for large values of d, omitting r(l — d) produces better estimates. 



The results for the bias corrected estimator are presented in the right side of Table 6.2 



From Table 6.2, for d < 0.2, the results for the bias corrected estimator have slightly worse 
performance than the canonical ones, but are still quite good. For higher values of d, however, 
the bias corrected estimator is far superior, especially for small lags. In Figure |6.1[ b) and 
Figure [6rT| c), we show the evolution of the estimated values for both estimators when d = 0.1 
and d = 0.45, respectively. Notice that for small values of d, the estimated value improves as 
the lag increases while for higher values of d, the opposite happens. As explained before, this is 
mainly due to the cumulative effect of the bias in estimating the copula parameter under strong 
dependence. 



6.4 Application to Simulation of Time Series 

In this subsection we present an application of the results in the previous sections to simulation 
of time series with a given decay of covariance. The main idea is to observe that a realization 
xi, • • • ,Xn from any process {Xt}tew can be seen as a random variate from the joint distribution 
of {Xi, ■ ■ ■ ,Xn)- That is, if for a given process, {Xt}teK*i we are able to determine the joint 
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Behavior of r(1 -d) 



Behavior of the estimates by iag when d = 0.1 Behavior of the estimates by lag when d = 0.45 




■0.4 -0.3 -0.2 -0.1 0.0 0.1 0.2 0.3 0.4 



(a) 



Canonical 
Bias Corrected 



(b) 



Canonical 

— Bias Corrected 



(c) 



Figure 6.1: (a) Plot of r(l — d) for d G (—0.45,0.45). Behavior of the estimated values of d by the 
canonical and bias corrected estimators for (b) d = 0.1 and (c) d = 0.45. 



distribution of {Xi, ■ ■ ■ , say H, and we are able to generate random variates from H, then 
each single variate from H is & size n realization of the respective process. 

With this in mind, consider the problem of simulating a time series, say xi, • • • ,Xn, with 
a given decay of covariance, say n~", for a prescribed a > 1, with fat tailed marginals which 
vary with the time. This is certainly a non-trivial task, but we shall show how to approach the 
problem by using the theory developed in the previous sections. 

Consider the following particular case of the construction presented in Example|2.1[ Let 9^ '■= 



Kq k A; G M*, for a > 1 and kq > ({a), and consider the sequence of copulas {C0^}keJM* from 

— 1/2 

the FGM family. Let {ak}k&¥i be an arbitrary sequence of real numbers and let b := ln(2)KQ 
Let {-FfcjfcgiN be a sequence of distribution functions distributed as EVI(afc,6), for each A: G W. 



From Example 2.1, if the copula related to Xt and X^^h is Cg^, and Xt is distributed according 
to Ft, then 

Cov{Xt,Xt+h) = h^"', for alH e IN and h e W . 
From this point we observe that the task will be accomplished if we can determine an n- 



G 



for i,j £ 



dimensional copula for which the marginals are the (2) copulas Cij 
{I,-- - ,n}, i 7^ j. For instance, proceeding as in Example 2.1, one can construct the n- 



dimensional copula (2.2). If we are able to produce a random variate from this n dimen- 
sional copula, say ui, • • • ,Un, by Sklar's theorem the desired time series is obtained by setting 



Xk 



,n. 



Remark 6.2. A word of caution is in time. Because of the compatibility problem, an n- 
dimensional copula satisfying the conditions imposed in the Example |2.1[ may not exist, so one 
has to be careful in the calculations. 



Simulation of commonly applied Gaussian time series with a given decay of covariance is an 
easy task through the theory developed in the previous sections. Recall that the n-dimensional 
Gaussian copula is given by 



Co(mi 



,u„) :=$fi($ ^(ui),--- ^(w„)), 



where Q = is a symmetric positive definite real matrix with diag(r2) = (I,-- - , l)""" 

and denotes the multivariate normal distribution function with variance- covariance matrix 
Q. It is clear that all bivariate marginals are also Gaussian copulas. Let Xi, • • • , Xn be random 
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variables for which the associated copula is an n-dimensional Gaussian copula Cq. Then the 
copula related to Xr and Xs is a Gaussian copula with parameter p = Jl^s- In order to simulate 
a Gaussian time series, therefore, we basically need to correctly specify the matrix Q. Once 0, 
is specified, we obtain a single random variate from the respective Gaussian copula and apply 
^>~^ to it. The result is the desired time series. 



For instance, referring to Example 5.1 to simulate a sample of size n from an MA{q) process 



with coefficients {'dk}1=Qj we construct the matrix 17 by taking 



^ ??fe??fe+|j_j+i|(5(|?-j + l| < g), i,j ^l,--- ,n, i^j. 



fc=0 



A sample of size 400 observations from an MA(1) process with polynomial '&{z) = 1 + 0.3z 



is shown in Figure a) along with the estimated autocorrelation function in Figure 6.2 d). 
Applying the methodology of Subsection 6.1, we obtain the estimated value "i? = 0.3052. 



MA{1) time series 



AR(1) time series 



ARFiMA(0, 0.4,0) time series 




(a) 

ACF of an MA(1) process 



■||ir'||-||--l|; 



10 20 



Lag 



(d) 





(b) 



ACF of an AR(1) process 




(c) 



ACF of an ARFiMA(0,0.4,0) process 



T 'ii j] ii j i'' H : '' ;; i r ' i M[; ' 



40 50 



30 40 



Lag 




(e) 



(f) 



Figure 6.2: Simulated time series {xt}"^i obtained by using the method of parameterization corre- 
sponding to an (a) MA(1); (b) AR(1), both with n = 400; and (c) ARFIMA(0,0.4,0) with n = 1,000. 
Estimated autocorrelation functions for lag 1 to 50 for (d) the MA(1) and (e) the AR(1) processes and 
for lag 1 to 200 for (f) the ARFIMA(0,0.4,0). 



Referring to Example 5.1 to obtain a realization of an AR(1) process, with coefficient tp, 
we construct the matrix $7 by taking = ip\^~^\. A sample of size 400 observations from 
an AR(1) process with polynomial ip{z) = 1 — 0.3z is shown in Figure |6.2| ^b) along with the 
estimated autocorrelation function in Figure |6.2[e) . Applying the methodology of Subsection 



6.2 we obtain the estimated value tp = 0.2963. 

Still referring to Example |5.1[ to obtain a sample of size n observations from a Gaussian 
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ARFIMA(0, d, 0) process, we set 



^ Y-r k - 1 + d 

= [[ — = 7^ 



k=l 



k-d 



The other expression in (5.2) works as weh. A sample of size n = 1,000 observations from 
an ARFIMA(0, d, 0) process for d = 0.4 is shown in Figme |6.2[c) along with the estimated 



autocorrelation function in Figure [6^ f). Applying the methodology of Subsection 6.3, for lag 
25, the canonical estimator (can) yields dean = 0.3514 and the bias corrected estimator (cor) 
yields 4or = 0.4080. 



7 Real Data Application 

In this section we apply the estimation methodology of Section [6] to the daily returns of the 
S&PSOO US stock market index in the period from 01/03/2000 to 11/03/2011, which gives a 
sample size n = 2,980. Figure [TlTI a) to Figure [7?T| c) present the S&:P500 time series, the 
correspondent returns {rt}'^^i and the absolute returns respectively. 

Among the stylized facts about financial time series related to stock market indexes, one 
often observes that the returns are uncorrelated while the absolute and squared returns are 
correlated and present slow decay of covariance, usually proportional to for /3 G [0.2, 0.4] in 
both cases (see Cont, 2001 and references therein). The study of the absolute and square returns 
are of great importance since they contain information about the (unobservable) volatility. Our 
goal is to apply the methodology developed in Section [6] to estimate /3. 



S&P500 index from 01/03/2000 to 11/03/2011 S&P500 returns from 01/03/2000 to 11/03/2011 Absolute returns of the S&P500 series 




(a) (b) (c) 

Figure 7.1: S&P500 (a) original time series; (b) return time series; and (c) absolute return time series. 

We shall work with the absolute return time series only, since due to the invariance of copulas 
to increasing transformations (notice that is a strictly increasing function for x £ [0, oo)), the 
underlying bidimensional copulas are the same in both cases, and, as long as F is well estimated, 
the conclusions in both cases should be basically the same. Notice that this is coherent with 
the stylized facts. We shall assume that the underlying process is strongly stationary and 
ergodic and that all underlying bivariate copulas belong to the same one parameter copula 
family parameterized in the spirit of the theory developed in Section [5] 

Figure |7.2[ a) presents the autocorrelation function and the periodogram of the absolute re- 
turn time series of the S&:P500 index. The slow decay of the sample autocorrelation function and 
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the pronounced peak at the zero frequency in the periodogram function both suggest long-range 
dependence on the absolute return time series. Sometimes slow decay on the autocorrelation 
function may be due to non-stationarity. To rule this hypothesis out, a Phillips-Perron unit root 
test (Phillips and Perron, 1988) can be performed on \rt\. The test rejects the null hypothesis of 
unit root at 5% confidence level, with p-value = 0.01. The conclusion reinforces the hypothesis 
of long-range dependence. 



oo 
t=0 



Given the behavior of the sample autocovariance function, it is often assumed that {\rt\} 
follows an ARFIMA(p, d, q) model, so that estimation of d is of interest. In this line, one has 
several estimator at disposal. See, for instance, the review in Lopes (2008) for classical and 
recent developments, Lopes et al. (2004) for the non-stationary ARFIMA(p, d, q) case and the 
recent method developed in Lopes et al. (2011). See also Palma (2007). 

To apply the methodology of Section |6j we shall first estimate the underlying marginal distri- 
bution of the data. Figure [7^ c) presents the histogram of \rt\, which suggests an Exponential 
distribution. 



ACF of |r,| 




Periodogram of |rt| 



Histogram of |rt| 




20 40 



80 100 



(a) 



(b) 



(c) 



Figure 7.2: (a) Autocorrelation function; (b) periodogram and (c) histogram of the S&P500 absolute 
return time series and the fitted Exponential density. 



By using a maximum likelihood approach, we fit an Exp(10.688) distribution, whose density 



is shown in Figure 7.2 c). We then define F ~ Exp(10.688) as the estimator for the underlying 



marginals and set yi = F{\rt\), t = 1, - ■ ■ , 2979. 

Next step is to identify the underlying family of bidimensional copulas related to the process. 
One way to do this is by plotting {iyi,yi+s)}'i=i ^^ for some values of s. The plots are a helpful 
tool to identify the underlying copulas by observing some key dependence features such as tail 



dependence, singular components, dependence strength, etc. In Figure 7^ we have done so for 
lags s G {1, 5, 25, 50, 100}. The plots suggest that the underlying family of copulas comprehend 
absolutely continuous ones, with relatively weak dependence for each lag and with no tail depen- 



dence. Furthermore, in view of Figure 7.3, it is reasonable to assume the bidimensional copulas 
to be Gaussian ones. 

For the Gaussian copula with Exp(10.688) marginals, Ki and K2 can be shown to be finite 
with values approximately equal to 260.47 and 40.53, respectively. The next step is to estimate 



the copula parameter p from the pseudo samples {iyi,yi+s)}'i=i~'^ ■ Figure 7.4 'a) presents the 
estimated copula parameter obtained via maximum likelihood method up to lag 400. Notice 
that the very first estimate appears distant from the cloud of estimates. Chosen a particular lag 
m, let pi, pm denote the estimates obtained in the previous step. For comparison purposes, 
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we perform the following three optimization procedures 



f 1 

Pgik) :=argmin 

/3G(0,1) 

A'eIR\{0} 



(7.1) 



dcor{k) 



argmin {\y^\p, -T{1 ~ d)T{d)-H"'-^\\ , /3ea«(fc) 1 - 24,„(fc), (7.2) 

E(-0.5,0.5) Ik ^ J 

argmin [\y^\p.^ -T{d)-H^''-^\\ , f3,or{k) := 1 - 2d,or{k), (7.3) 

E(-0.5,0.5) Ik ^ J 



for /c = 1, • • • , m. In all cases, the final estimate, denoted by /3g, /3can and ficor-, respectively, is 
defined as 



EPg{k)\ Pcan —y^Bcan{k) and Pcor -"^ —^^^ Pcor{k) ■ 
m ' ^ Til ' ^ 



(7.4) 



k=l 



k=l 



fc=l 



For estimating f3g, the particular value of the constant K is of no interest, but it is necessary 
in the estimation since we are assuming that the decay of covariance is proportional to n~^. 
The estimators dean and dcor are the canonical and the bias corrected estimators, respectively, 
introduced in Subsection |6.3| which are transformed in order to reflect a decay proportional 
to n~^. In this case we are tacitly assuming an ARFIMA(0, d, 0) model with exponentially 
distributed marginals. 




0.0 0.2 0.4 0.6 




Pseudo-sample of the lag 25 copula 




0.0 0.2 0.4 0.6 



(a) 



(b) 



(c) 



Pseudo-sample of the lag 50 copula 



Pseudo-sample of the lag 100 copula 




0.0 0.2 0.4 0.6 0.8 1.0 




0.0 0.2 0.4 0.6 0.8 1.0 



(d) 



(e) 



Figure 7.3: Plot of {{y„y,+,)}fl['''''' for (a) s = 1; (b) s = 5; (c) s = 25; (d) s = 50 and (e) s = 100. 



Since in the literature of ARFIMA process, the covariance decay is taken proportional to 
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n 



2d-l 



, for d G (0,0.5), and here we are studying a decay proportional to n ^, for /3 G (0, 1), 
when appropriate, we shall present the results in both scales. Notice that f3 and d are related 
by /3 = 1 - 2d. 



Figure 7.4 ^b) presents the sequences (3g{k), fican{k) and Pcor{k), given respectively by (7.1), 
1, • • • , 400. We notice that for small lags, all estimators vary rapidly, but 
7.4 'c) presents the estimators f3g, /Scan and /3, 



( |7.2D and (jTJ), for k 

become smooth after 50 lags or so. Figure 
1, 



by (7.4), for m 



given 



,400. In Figure 7.4 'b) and Figure |7.4[c), the horizontal lines represent 



the interval [0.2,0.4]. 



Estimated copula parameter by lag 



Estimates as a function of k 



Pg(k) • Pc,„(k) • Pc„r(k) 



Evolution of the final estimates by lag 




200 

k 



(a) 



(b) 



(c) 



Figure 7.4: (a) Copula parameter estimated values; (b) Estimated $g{k), jican{k) and Pcorik) given by 
( 7.1 1, ( 7.2 ) and ( |7.3| , respectively, for fc = 1, • • • , 400; (c) Evolution of the estimators /3g, Pcan and Pcor 
given by (7.4), for m = 1, • • • ,400. 



Table 7.1 presents the estimated value (3g, (Scan and jicor for lags 5, 10, 25, 50, 100 and 400. 
The estimated value of [3g takes long to stabilize and, for small lags, this estimate is clearly off. 
This behavior is due to the flexibility in performing a two parameter optimization, which can 
lead to extremely bad estimates for small number of observations. However, as the lag increases, 
fig stabilizes and takes values close to the other estimators. 

Table 7.1: Estimated /3g, Pcan and Pcor values for lag m G {5, 10, 25, 50, 100, 400} for the absolute return 
time series of the S&P500 index. Also presented are d^an and d^or (in parenthesis). 



Estimator 


Lag m 


5 


10 


25 


50 


100 


400 




-0.2266 


-0.1442 


-0.0200 


0.0549 


0.1151 


0.2755 




0.6216 


0.5397 


0.4509 


0.4081 


0.3827 


0.3682 


{dean ) 


(0.1892) 


(0.2301) 


(0.2746) 


(0.2959) 


(0.3087) 


(0.3159) 




0.5747 


0.4798 


0.3833 


0.3424 


0.3182 


0.3152 


{dcor ) 


(0.2127) 


(0.2601) 


(0.3083) 


(0.3288) 


(0.3409) 


(0.3424) 



Comparing the values obtained for dean and dcor in Table 

we expect considerable bias in d, 
is obtained for m 



6.2 



7.1 



timated values in Table 
for dcor-, according to Subsection 



(in parenthesis) and the es- 
can- The most accurate estimate 



6.3 



= 25, that is, dcor = 0.3083 with 
0.3834 with 95% confidence interval 



95% confidence interval (0.3038, 0.3140) which gives j3ca 
(0.3720,0.3924). The quantiles and standard deviation necessary to obtain the confidence in- 
terval were obtained by using the so-called stationary bootstrap method (Politis and Romano, 
1993) with random blocks of length geometrically distributed with mean 1,000. We perform 
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1,000 bootstrap replicates (see also Davison and Hinkley, 1997). Consistently, considering the 
value of dean obtained, an inverse search on the results for the canonical estimator in Table 
places d near 0.30, or equivalently, (3 close to 0.40. 



6.2 



8 Conclusions and Final Remarks 

In this work, we study the problem of constructing stochastic processes with a predetermined 
decay of covariance by parameterizing a family of copulas and the process' marginals. Although 
the main interest in practice lies on stationary processes, the theory proposed here covers both 
stationary and non-stationary processes and allow for arbitrary decay of covariance. We present 
several examples to illustrate the theory, including the widely applied Euclidean, Extreme Value 
and Gaussian family of copulas. We show how the theory blends with some common applied 
time series models such as the large class of ARFIMA processes. 

To show the usefulness of the theory, we apply it to the problem of estimation in time series 
models. We develop a general methodology of parameter estimation identifiable through the 
covariance decay of the process based on the theoretical results obtained in the paper. Examples 
are provided for better understanding the technique. 

To assess the methodology's finite sample performance, we perform some Monte Carlo sim- 
ulations. The simulations reinforce the competitiveness of the methodology, as well as illustrate 
its use. Also motivated by the theory developed in the paper, we present an application to 
simulation of time series with given decay of covariance and marginals by parameterizing the 
bidimensional copulas of the process. The method allows to obtain realizations of non-standard 
time series in a fast and relatively simple way. 

We finally show how to apply the methodology by analyzing the S&:P500 stock market index 
time series. We use the methodology to study the covariance decay of the absolute (and for 
consequence, of the squared) S&:P500 return time series. We conclude that the methodology 
yield results coherent with the stylized facts of financial time series. 
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